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Abstract. We study representations of ideal languages by means of 
strongly connected synchronizing automata. For every finitely generated 
ideal language L we construct such an automaton with at most 2" states, 
where n is the maximal length of words in L. Our constructions are based 
on the De Bruijn graph. 
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1 Introduction 

Let £^ = {Q^U,d) be a deterministic finite automaton (DFA for short), where 
Q is the state set, S stands for the input alphabet, and S : Q x S ^ Q is the 
transition function defining an action of the letters in S on Q. When 6 is clear 
from the context, we will write q . w instead of 6{q, w) for q € Q and w € S* . 

A DFA £/ — {Q, U, S) is called synchronizing if there exists a word w E S* 
which leaves the automaton in unique state no matter at which state in Q it is 
applied: q.w = q' .w for all q, q' £ Q. Any word w with such property is said to be 
synchronizing (or reset) word for the DFA £/. For the last 50 years synchronizing 
automata received a great deal of attention. For a brief introduction to the theory 
of synchronizing automata we refer the reader to the recent surveys [71|5] . 

In the present paper we focus on language theoretic aspects of the theory of 
synchronizing automata. We denote by Syn(^) the language of synchronizing 
words for a given automaton £/. It is well known that Syn(^) is regular j8j. Fur- 
thermore, it is an ideal in S*, i.e. Syn(j2/) = S* Syn{s:/)U* . On the other hand, 
every ideal language L serves as a language of synchronizing words for some au- 
tomaton. For instance, the minimal automaton of the language L is synchronized 
by L [4]. Thus, synchronizing automata can be considered as a special represen- 
tation of ideal languages. Effectiveness of such representation was addressed 
in [3]. The reset complexity rc{L) of an ideal language L is the minimal possible 
number of states in a synchronizing automaton £/ such that Syn(^) = L. Ev- 
ery such automaton £/ is called minimal synchronizing automaton (for brevity, 
MSA). Let sc{L) be the number of states in the minimal automaton recognizing 
L. Then for every ideal language L we have rc{L) < sc(L j^. Moreover, there are 
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languages L„ for every n > 3 such that rc{Ln) — n and sc(L„) = 2" — n, see [4]. 
Thus, representation of an ideal language by means of a synchronizing automa- 
ton can be exponentially smaller than "traditional" representation via minimal 
automaton. However, no reasonable algorithm is known for computing MSA of 
a given language. One of the obstacles is that MSA is not uniquely defined. For 
instance, there is a language with at least two different MSA's: one of them is 
strongly connected, another one has a sink state j4l . Therefore, some refinement 
of the notion of MSA seems to be necessary. Another important observation is 
the following: minimal synchronizing automata for the aforementioned languages 
L„ are strongly connected. Thus, one may expect that there is always a strongly 
connected MSA for an ideal language. In the present paper we show that it is not 
the case. Moreover, the smallest strongly connected automaton with a language 
L as the language of synchronizing words may be exponentially larger than a 
minimal synchronizing automaton of L. 

Another source of motivation for studying representations of ideal languages 
by means of synchronizing automata comes from the famous Cerny conjecture. 
Cerny already in 1964 conjectured that every synchronizing automaton possesses 
a synchronizing word of length at most {n — 1)^. Despite intensive efforts of re- 
searchers this conjecture is still widely open. We can restate the Cerny conjecture 
in terms of reset complexity as follows: if £ is the minimal length of words in 
an ideal language L then rc{L) > y/1 -\- 1. Thus, we hope that deeper under- 
standing of reset complexity will bring us new ideas to resolve this long standing 
conjecture. It is well known that the Cerny conjecture holds true whenever it 
holds true for strongly connected automata. In this regard an interesting related 
question was posed in [2 : does every ideal language serve as the language of syn- 
chronizing words for some strongly connected automaton? For instance, if the 
answer is negative then there is a way to simplify formal language statement of 
the Cerny conjecture. Unfortunately, it is not the case. Recently Reiss and Ro- 
daro [6] for every ideal languageE3 L presented a strongly connected automaton 

such that Syn{£/) = L. Their proof is non-trivial and technical. In the present 
paper we give simple constructive proof of the fact that every finitely generated 
ideal language L, i.e. L = S*US* for some finite set U , serves as the language 
of synchronizing words of some strongly connected automaton. Our construc- 
tions reveal interesting connections with classical objects from combinatorics on 
words. 

2 Algorithms and automata constructions 

Let E he a, finite alphabet with \S\ > 1. Let i be a finitely generated ideal 
language over E, i.e. L = S*SS*, where 5* is a finite set of words. In this 
section we construct a strongly connected synchronizing automaton for which 
L = S*SS*. 

First recall some standard definitions and fix notation. A word u is a factor 
{prefix^ suffix) of a word w, if w = xuy [w — uy, w = xu respectively) for some 
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x,y ^ S*. By Fact{w) we denote the set of all factors of w. The i*'* letter of the 
word w is denoted by w[i\. The factor it;[i]w[i + 1] • • • w[j] is denoted by w[i..j]. 
By Z"" {S-"^, S-"^) we denote the set of all words over S of length n (at most 
n, at least n respectively). 

Note, that if a word s e S" is a factor of some other word t d S, then the word 
t may be deleted from the set S without affecting the ideal language, generated 
by S. Thus, we may assume, that the set S is anti-factorial, i.e. no word in S is 
a factor of another word in S. 

2.1 Ideal language generated by 17" 

Theorem 1. Let S ~ {a,b}. There is unique up to isomorphism strongly con- 
nected synchronizing automaton such that Syn(^) — Z"-". 

Proof. Consider De Bruijn graph for the words of length n. Recall that the 
vertices of this graph are the words of length n, and there is a directed edge 
from the vertex u to the vertex v, if u — xs and v — sy for some s E S"^^, 
x,y G E. By labeling each edge e = (u, v) by the last letter of v we obtain De 
Bruijn automaton. Its state set is Q = Z", and transition function is defined in 
the following way: xs .y = sy for s € Z"""^, x,y £ S. De Bruijn automaton is 
known to be strongly connected. Thus it remains to verify that Syn(^) — L. 
It is easy to see that for an arbitrary word u of length at most n we have 
Q .u = Hence for any word w of length n we have |Q . w| — 1, and for 

any word u of length less than n we have \Q .u\ > 1. So, Syn(^) = L. 

Let — {Q, E, S) be a strongly connected synchronizing DFA such that 
Syn(^) = L. Let us prove that \Q\ < 2". Strong connectivity implies Q .aU 
Q .b — Q. By induction it is easy to see that Q = lj|ui|=fe Q -w.lii particular, we 

have g = U|™|=„ Q • ^- Thus, |Q| = | U|»|=„ Q • < E|.|=„ \Q-w\^ 2". The 
last equality follows from the fact that every word of length n synchronizes so 
each Q .w is a singleton. For the converse inequality 2" < \Q\ consider the DFA 
^a, obtained from by removing all transitions corresponding to the action of 
b in The word a" synchronizes so "T^a contains no cycles but unique loop. 
So the automaton has a tree-like structure as it is shown on FigH] Denote 
by s the state of such that s .a = s. The state s is called root of the tree, and 
the states pi, P2, ■ ■ ■ , Pk having no incoming transitions labeled by a are called 
leaves of the tree. The height h{pi) of a vertex pi is the length of the path from 
Pi to the root s. The height of the tree h{'^a) is the maximal height of its leaves. 
We have h{'^a) = n. Indeed, if h{'^a) = h < n, then we would have Q .a^ = {s}, 
meaning that a** G Syn(^), which is impossible. 

Consider the set of leaves H = Q\Q .a — {pi,p2, ...,pk}- Since the DFA 
is strongly connected, for each state pg in H there exists a state qe such that 
q^.b = pe. Thus H C Q .b. We show that H is exactly Q.b, meaning that 
Q .a D Q .b = 0. Take a leaf of height n. Without loss of generality suppose 
it is pi. Let qi be such that qi-b = pi. The word ba"^~^ is synchronizing, so 
Q .ba^^~^ = {q} for some q € Q. We have qi .6a"~^ = q, and q.a = s (see 
Fig[T]). Suppose there ispGQ. aCiQ . b. Then there is a state q such that q.b = p. 
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Fig. 1. The action of a in 

Since p is not a leaf, we have h{p) < n. Then q.ba"'~^ = p.a"~^ = s ^ q. A 
contradiction. Hence H = Q .b. Furthermore, the height of any leaf of 'i^a is 
exactly n. To see this assume that there exists a state Pm such that h{pm) < n, 
i.e. p„i . = s, for some £ < n. Then the word ba^~^ is not synchronizing. Indeed, 
take a state Qm such that qm-b = Pm- We have qm ■ ba^~^ = Pm ■ a"~^ = s ^ q. 

Consider an arbitrary state p G Q.a. Let 5~^{p,u) = {p' €: Q \ p' .u = 
p}. We prove that |(5~"'^(p, a)| > 2 for each p G Q.a. For the root s wc have 
{s,q} C 5~^{s,a), thus, |(5~^(s,a)| > 2. Let p be an arbitrary state in Q.a. 
Strong connectivity of implies that there exists a state p and a word w e Z"" 
such that p . w = p. Since w is synchronizing, we have Q .w = {p}. Consider the 
word w[l..n— 1] that does not synchronize 'to. Then \Q . i^ll-.n— 1]| > 2. However, 
{Q . w[l..n — 1]) .w{n] = p. And we obtain the inequality a)| > 2. Denote 

Ho = {q} and construct sets iJ^ = S^^ {Hi-i, a) for 1 < i < n — 1. Wchave \Hi\ > 
2^ for ah 1 < i < n - 1. Then possesses at least 1 + 1 + 2 + 4+ ... + 2"-i = 2" 
states. 

Thus wc have \Q\ = 2". Moreover, Q = U|tu|=„Q . w. It means that with each 
state g of (5 we can associate the word w of length n such that Q .w = {q}. It 
is clear that it gives us the desired isomorphism between ^ and ^. 

□ 

Remark 1. In case S = {a, 6}UZ\, where Z\ 7^ 0, we consider De Bruijn automa- 
ton constructed for the binary alphabet {a, 6} and put the action of each letter 

in A to be the same as the action of the letter a. It is clear that the language of 
synchronizing words of the modified De Bruijn automaton coincides with 17-". 

The Proposition implies that the minimal DFA recognizing an ideal lan- 
guage L can be exponentially smaller than a strongly connected MSA ^ with 
Syn{^) = L. 



Finitely generated ideal languages and synchronizing automata 



5 



2.2 Ideal language generated by a set of words of fixed length 

Theorem 2. Let U C There is a strongly connected synchronizing automa- 
ton S^u with 2" states such that Syn(^[/) = S*US*. 

Proof. We modify the De Bruijn automaton ^ from the section 2.1 to obtain 
the desired automaton ^^jj- First of all it is convenient to view the states of the 
automaton IJ§ not as the words of length n, but as pairs (x, u), where x Cz S and 
u G S"^^. Then by the definition of the transitions in ^ we have 

{x, u) A (z, v) uy = zv (1) 

For a word uy which is not in U , we modify the corresponding transition given 
by (IlJ in the following way. li uy ^ [/ U {a", 6"} we put 

ix,u)^{x,v), (2) 

where v is defined by (P). 

li uy ^ a"- ^ U {uy = h'^ respectively) we put 

(a, a"~i) A (6, a"-i), ((6, 6"-^) A (a, respectively). (3) 

The other transitions remain unchanged. The obtained automaton is denoted 



a 




Fig. 2. De Bruijn automaton for n = 3 



by ^jj. The examples of the automaton S§ and the corresponding modified 
automaton 3§u for U = {aaa, abb, hah} are shown on FigHJand Fig|3]respectively. 
We prove that the automaton satisfies the statement of the proposition. First 
we show that is strongly connected. For this purpose we prove that all the 
states are reachable from the state (a, a"""'^), and the state (a, a"~^) is reachable 
from all states. 




Fig. 3. Automaton SBv for U = {aaa, abb, bab} 



First we show that a state (a, u) is reachable from (a, a"'~^) for any u G 
If u = a"~^, the claim obviously holds. Hence we may assume u = a'^bu, where 
k > 0, u G By the definition of transitions in we have 

(a, a"-i) \ {a, a^-'b) ^ {a, a--'bu[l]) ^ • • • ^tz^ 

{a, a''+Hu[l..n - fc - 3]) "'""''"^'j (a, a'=6u[l..n - fc - 2]) = (a, u). 

Symmetrically any state {b,u) is reachable from the state (6, 6"~^). The latter 
state is reachable from (a, 6"~^). Thus the state {b,u) is reachable also from 
(a,a"-i): 

Now we show that the state (a,a"^^) is reachable from any other state. 
Apply the word a"~^ to an arbitrary state {x, u). By the definition of transitions 
we have {x,u).a"-^ G {(a, a"-^), (&, a""!)}. If (x,w).a"-^ = (a,a""^) we are 
done. If (x, u) . a"~^ = {b, a"~^), then we apply once more the letter a and obtain 
.a" = (a,a"-i). 

Thus the constructed automaton is strongly connected. Next wc show 
that Syn(^t/) = X!*UX!*. It is easy to see that for any word u € Z""^^ wc have 
Q .u C {(a, u), (6, m)}, and Q .uf] Q .v ~ for u, w G U"^^ such that u ^ v. 
Thus Q D U Q . w. Next we check that Q = U Q . w. Indeed, if a" e (7 

|w|=n— 1 |u|=n — 1 

we have (a, a""-^) A (a, u) for all u G 17""^. li a" ^ U take any word u G 17"^^ 
If u = a"~^ then u maps the state (a, a"~^) or the state (6, a"~^) to {a,u). 
Let us assume now that u = a'^bu. If k is even (odd, respectively) then u maps 
(a.n"^^) ((6,0,"^-'^). respectively) to {a,u). So any states {a,u) belongs to the 
set IJ Q . u. Symmetrically any states (6, u) belongs to the latter set. Hence 

|u|=n— 1 

Q = U Q -U- Since \Q\ = 2", if there is a synchronizing word u of length 

\u\=n—l 
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n — 1, we would have 2"' = |(5| = | |J Q.u|<2", which is a contradiction. 

\u\—n — l 

Thus, none of the words of length n — 1 is synchronizing. Consider an arbitrary 
word w of length n and factorize it as w = uy with u G E"~^ and y G S. We 
have Q .u = {{a,u), {b,u)}. If w G ?7, then the corresponding transitions from 
the states (a, u) and {b,u) were not changed, and we have Q .uy = {{z,v)}, 
where uy = zv, so w is synchronizing. If w ^ U, then Q .uy = {{a,v), {b,v)}, 
where v is such that uy = zv for some z G E, so w ^ Syn{^u). 

□ 

2.3 Ideal languages generated by a finite set of words 

Theorem 3. Let S be finite and anti-factorial set of words in . There is a 
strongly connected synchronizing automaton ^5 such that Syn('^5) = S*SS* . 
This automaton has at most 2" states, where n = max{|s| | s e 5}. 

Proof. Let T = {w £ \ 3s G S*, s G Fact{w)}. First we construct the au- 
tomaton as described in the previous proposition. In that proposition the 
states of were viewed as pairs (x, u) with x G S, u G E"'~^. Here it will be 
convenient to view the states as the words xu of length n (as it was in the initial 
De Bruijn automaton). Note, that since S is anti- factorial, every state in T can 
be uniquely factorized as usv such that s G S, u,v G S* and sv does not contain 
factors in S except s. In what follows we will use this unique representation 
without stating it explicitly. 

Next we define an equivalence relation ~ on the set of states of this automaton 
(i.e. on words of length n) in the following way. Let w, w' G T. We have w ~ w' 
iS w = usv and w' = u'sv, where s G S, u,u',v G S* . On the set Z"" \ T the 
relation ~ is defined trivially, i.e. for w, w' G S"' \ T we have w ~ w' iff w; = w'. 
It is easy to see that ~ is indeed an equivalence relation on In fact, ~ is 
a congruence on the set of states of the automaton ^t- Let us check that for 
any x G S and any w, w' G i7" w c:i w' implies w . x ~ w' . x. liw,w' G \ T, 
then w = w' and we are done, if u>,u>' G T, then w = usv, w' = u'sv. If 
u = u' = e, then w = w', and there is nothing to prove. So we may assume, 
that u,u' 7^ e. Then usv.x = tsvx and u'sv.x = t'svx for some t,t' G S*. 
Since the obtained two words have the same suffixes, containing a word in S, 
they are equivalent. So we can consider the factor automaton ^t/ — , whose 
states are the equivalence classes of ~, and the transition function is induced 
from the initial automaton. Let us denote by [sv] the equivalence class of a word 
usv G T, and by [u] the equivalence class of a word u ^ T. We claim, that 
■^5 = ^t/ — • In other words, the constructed automaton is strongly connected, 
and Syn(,f^T/ — ) = E*SS*. The first property holds trivially, since a factor 
automaton of a strongly connected automaton is strongly connected. 

For any w G S* and s G S previously in we had w .s = us, where 
u G S* . Since S is anti- factorial, in I^t/ — wc have [w] . s = [s], so any s G S' is 
synchronizing for the automaton — • Now let t be a synchronizing word, so 
there is a state [w] such that for any state [w'] we have [w'] .t — [w]. If [w] is a 
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one-element class, then the word t was synchronizing for the initial automaton 
so t contains some word in S" as a factor, i.e. t e S*SX!*. Consider the case 
where [w] is a class consisting of elements uisv, U2sv, . . . ,UkSV, k > 1. Note 
that in this case Ui ^ e for each i = 1, . . . , fc. This means that t = usv for some 
u e S*, thus, also in this case t e S*Si:*. 

□ 

Complete this section with an example. Let S = {a^,a&a} and E = {a,b}. 
Construct the corresponding set T — {a^,a'^b,ba-^,aba}. Next build the DFA 
^T- The resulting automaton is shown on the left side of FiglH 




Fig. 4. Automata and ,S§t/ — for T = {aaa, aab, baa, aba} 



By the definition of ~ the class [aa] includes states aaa and baa. The rest 
classes are one-element. The resulting automaton ^^t/ — is shown on the right 
side of FiglH 

2.4 Ideal languages generated by two words 

Let S = {u, v} C 17+ and let |u| = n, \v\ — m. Again we suppose that S is anti- 
factorial. In this case we can construct a strongly connected automaton i^u,v 
such that Syn(i^,j„) = S* (u + v)S* with n + ni states, thus improving construc- 
tion from the previous section. For simplicity we state and prove the following 
theorem only for the case of binary alphabet, although the same argument works 
in general. 

Theorem 4. Let S = {a,b}, and letuE Z"" \ {a6"-i, a"-i5, 6a"~\ 6"-ia}, v G 
E"^\{ab™~^ , a^^^b, ba"^^^, 6™^^a}. There is a strongly connected synchronizing 
automaton &u,v having n + m states such that Syn(i^,i.^) = I]*{u + v)S* . 

Proof. In order to obtain ^^u,v we combine minimal automata for the languages 
E*uE* and E*vS* . For a letter x G {a, 6} by x we denote its complementary 
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letter, i.e. a ^ b, and b — a. Recall the construction of the minimal automaton 
recognizing the language S*wS*, where w E ^7+. It is well-known that this 
automaton has |w| + 1 states. We enumerate the states of this automaton by the 
prefixes of the word w so that the state maps to the state under 

the action of the letter w[i + 1] for all i, < i < k. The other letter + 1] 
sends the state w;[l..z] to state p such that p is the maximal prefix of w that 
appears in w[l..i + 1] as a suffix. The state w is the sink state of the automaton. 
The initial state is e and the unique final state is w, see FigO (the transitions 
labeled by complementary letters w[i\ are not shown). 



6 a,b 




Fig. 5. The minimal DFA j^^ 



Construct minimal automata jz/^ and £/y. Denote by ^ the automaton ob- 
tained from by deleting the sink state and the transition from u[l..n — 1] 
labeled by u[n]. Denote by jz/^,' the corresponding automaton for v. Define the 
action of letters u[n\ and v[m\ on states u[l..n — 1] and v[l..m — 1] as follows. 
Denote by p the state in ^/^ corresponding to the maximal prefix of u that 
appears in w as a suffix. Denote by s the state in corresponding to the max- 
imal prefix of V that appears in u as a suffix. We put u[l..n — 1] . u[n\ = s and 
vll-.m — 1] .vim] = p. Denote the resulting automaton by &u,v and prove that 
it satisfies the desired properties. Figures I6I7I8I illustrate the construction for 
u = abaab and v — babab. 




b 



Fig. 6. The minimal DFA recognizing E* abaabS* . 




a 

Fig. 7. The minimal DFA recognizing E*bababE* . 
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Fig. 8. The DFA &u,v 



The following claim is rather easy to see. The explicit proof can be found in [3] . 

Claim. If w G S" \ {a"^^&, a6"^^}, then the automaton is strongly con- 
nected. 

By the Claim automata s:/^ and s/y are strongly connected. By the definition 
of the action of letters u[n\ and v[m\ on states u[l..n — 1] and v[l..m — 1], the 
resulting automaton S'u,v is also strongly connected. 

Now we are going to verify that u,v £ Syn(^„^i,). The state set of &u,v is the 
union of the state set of (denoted by Q"), and the state set of jz/J (denoted 
by Q"). To avoid confusion when necessary we will use the upper indices u and v 
for the states in Q" and respectively. Let w be an arbitrary word. We claim 
that .w = r, where r is the maximal prefix of either u or w which is a sufhx 
of w. Let us consider the path from e" to r. As long as we do not use modified 
transitions, i.e. the ones that lead from Q" to Q" or vice versa, the claim holds 
true by the definition of jz/^^ and Suppose now that the path contains a 

transition u[l..n — 1] — ^ s and .w' = u[l..n ~ 1], where w' is a prefix of 
w. Let s' be the maximal prefix of the word v which is a suffix of w'u[n\. Note 
that w'u[n\ has u as a suffix. Therefore \s'\ < \u\, otherwise u is a factor of v. 
Since |s'| < |m| we have s' — s. Similar reasoning applies in case of transition 



^ p. Therefore, the claim holds true. It is not hard to see that 
where r' is the maximal prefix of either w or w which is a suffix 



v[l..m — 1] 
also e'" .w - 
of w. 

Now we are ready to show that u is a synchronizing word for &u,v By the 
definition of ^u,v we have e" . u = s. Let us consider an arbitrary state t £ Q". 
Let .tu = r. Note that the maximal prefix of the word u which is a suffix of tu 
is equal to u. Then by the claim r is a prefix of v. Since u is not a factor of v we 
have \r\ < \u\. Thus, r — s due to maximality. Since .t = t we have t .u = s. 
Thus, .u = {s}. Arguing in the same way for the state we get .u — {s}. 
So, u is synchronizing. Analogously, one can show that v is synchronizing. 
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To complete the proof it remains to verify that each word from the set 
Syn{!^u,v) contains m or w as a factor. Take w £ Syn(i^,j.„) and a state r e Q". 
li r.uu G Q", then w maps all states in the component into the same 
state. In particular, e'" .w € Q". Thus v appears in w as a factor. Analo- 
gously if r . w g Q", the word u appears as a factor in w. So we proved that 
Syii{&u,v) ^ S*{u + v)Z:*. □ 

Acknowledgement. The authors acknowledge support from the Presiden- 
tial Programm for young researchers, grant MK-266. 2012.1. 
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